Search CORE

31 research outputs found

Doctor of Philosophy

Author: Fu Zhisong
Publication venue: University of Utah
Publication date: 01/12/2013
Field of study

dissertationPartial differential equations (PDEs) are widely used in science and engineering to model phenomena such as sound, heat, and electrostatics. In many practical science and engineering applications, the solutions of PDEs require the tessellation of computational domains into unstructured meshes and entail computationally expensive and time-consuming processes. Therefore, efficient and fast PDE solving techniques on unstructured meshes are important in these applications. Relative to CPUs, the faster growth curves in the speed and greater power efficiency of the SIMD streaming processors, such as GPUs, have gained them an increasingly important role in the high-performance computing area. Combining suitable parallel algorithms and these streaming processors, we can develop very efficient numerical solvers of PDEs. The contributions of this dissertation are twofold: proposal of two general strategies to design efficient PDE solvers on GPUs and the specific applications of these strategies to solve different types of PDEs. Specifically, this dissertation consists of four parts. First, we describe the general strategies, the domain decomposition strategy and the hybrid gathering strategy. Next, we introduce a parallel algorithm for solving the eikonal equation on fully unstructured meshes efficiently. Third, we present the algorithms and data structures necessary to move the entire FEM pipeline to the GPU. Fourth, we propose a parallel algorithm for solving the levelset equation on fully unstructured 2D or 3D meshes or manifolds. This algorithm combines a narrowband scheme with domain decomposition for efficient levelset equation solving

The University of Utah: J. Willard Marriott Digital Library

Parallel breadth first search on GPU clusters

Author: Berzins Martin
Fu Zhisong
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2014
Field of study

pre-printFast, scalable, low-cost, and low-power execution of parallel graph algorithms is important for a wide variety of commercial and public sector applications. Breadth First Search (BFS) imposes an extreme burden on memory bandwidth and network communications and has been proposed as a benchmark that may be used to evaluate current and future parallel computers. Hardware trends and manufacturing limits strongly imply that many-core devices, such as NVIDIA® GPUs and the Intel ® Xeon Phi ® , will become central components of such future systems. GPUs are well known to deliver the highest FLOPS/watt and enjoy a very significant memory bandwidth advantage over CPU architectures. Recent work has demonstrated that GPUs can deliver high performance for parallel graph algorithms and, further, that it is possible to encapsulate that capability in a manner that hides the low level details of the GPU architecture and the CUDA language but preserves the high throughput of the GPU. We extend previous research on GPUs and on scalable graph processing on supercomputers and demonstrate that a high-performance parallel graph machine can be created using commodity GPUs and networking hardware

The University of Utah: J. Willard Marriott Digital Library

A fast iterative method for solving the eikonal equation on triangulated surfaces

Author: Fu Zhisong
Whitaker Ross T.
Publication venue: University of Utah
Publication date: 01/01/2012
Field of study

poste

The University of Utah: J. Willard Marriott Digital Library

The Photosynthetic Characteristics of \u3cem\u3eHemarthria compressa\u3c/em\u3e in Different Seasons

Author: Chen Lingzhi
Fu Xiantao
Huang Huijun
Tang Zhisong
Yang Chunhua
Publication venue: UKnowledge
Publication date: 05/07/2020
Field of study

University of Kentucky

Overseeding Whipgrass with Cool‐Season Annuals to Increase Yield and Quality in a Hay Field in Southwest China

Author: Chen Lingzhi
Fu Xiantao
Hang Huijun
Tang Zhisong
Yang Chunhua
Publication venue: UKnowledge
Publication date: 08/02/2021
Field of study

University of Kentucky

Yield and Quality of Whipgrass Mixed with Different Levels of White Clover

Author: Chen Lingzhi
Fu Xiantao
Huang Huijun
Tang Zhisong
Yang Chunhua
Publication venue: UKnowledge
Publication date: 16/07/2020
Field of study

University of Kentucky

Overseeding Whipgrass with Cool‐Season Annuals to Increase Pasture Yield and Quality in Southwest China

Author: Chen Lingzhi
Fu Xiantao
Hang Huijun
Tang Zhisong
Yang Chunhua
Publication venue: UKnowledge
Publication date: 20/06/2021
Field of study

University of Kentucky

A FAST ITERATIVE METHOD FOR SOLVING THE EIKONAL EQUATION ON TRIANGULATED SURFACES

Author: Fu Zhisong
Jeong Won-Ki
Kirby Robert M.
Pan Yongsheng
Whitaker Ross T.
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 19/08/2014
Field of study

This paper presents an efficient, fine-grained parallel algorithm for solving the Eikonal equation on triangular meshes. The Eikonal equation, and the broader class of Hamilton-Jacobi equations to which it belongs, have a wide range of applications from geometric optics and seismology to biological modeling and analysis of geometry and images. The ability to solve such equations accurately and efficiently provides new capabilities for exploring and visualizing parameter spaces and for solving inverse problems that rely on such equations in the forward model. Efficient solvers on state-of-the-art, parallel architectures require new algorithms that are not, in many cases, optimal, but are better suited to synchronous updates of the solution. In previous work [W. K. Jeong and R. T. Whitaker, SIAM J. Sci. Comput., 30 (2008), pp. 2512-2534], the authors proposed the fast iterative method (FIM) to efficiently solve the Eikonal equation on regular grids. In this paper we extend the fast iterative method to solve Eikonal equations efficiently on triangulated domains on the CPU and on parallel architectures, including graphics processors. We propose a new local update scheme that provides solutions of first-order accuracy for both architectures. We also propose a novel triangle-based update scheme and its corresponding data structure for efficient irregular data mapping to parallel single-instruction multiple-data (SIMD) processors. We provide detailed descriptions of the implementations on a single CPU, a multicore CPU with shared memory, and SIMD architectures with comparative results against state-of-the-art Eikonal solvers.open4

Crossref

ScholarWorks@UNIST